WikiMatch results for OEAI 2012

نویسندگان

  • Sven Hertling
  • Heiko Paulheim
چکیده

WikiMatch is a matching tool which makes use of Wikipedia as an external knowledge resource. The overall idea is to search Wikipedia for a given concept and retrieve all pages describing the term. If there is a large amount of common pages for two terms, then the concepts will have similar semantics. We make also use of the inter-language links between Wikipedias in different languages to match multilingual ontologies. The results show that this simple idea can keep up with state of the art tools. Moreover, the results on the Multifarm track depend on the Wikipedia’s number of articles as well as the link amount to the Wikipedia of the other natural language to match. The growth of Wikipedia will thus help this matcher to improve the matching quality. 1 Presentation of the system 1.1 State, purpose, general statement WikiMatch is an element-level ontology matching tool. It uses Wikipedia as a huge background knowledge to find out, how similar two concepts are. The algorithm extracts all labels, comments, and URI fragments, and uses Wikipedia’s search function to retrieve an set of articles related to that term. If the intersection between such two sets is high, then we assume that the terms have something in common and are related to each other. To also deal with multilingual ontologies, all language links of the returned articles are requested as a second step. For each language, the Jaccard coefficient of the two sets of articles retrieved is computed, as equation (1) shows. sim(t1, t2) := maxti∈{label(ci),fragment(ci),comment(ci)},i∈{1,2} #(S(t1) ∩ S(t2)) #(S(t1) ∪ S(t2)) (1) For the terms t1 and t2 the resulting article set S is computed. The maximum over all labels, comments and URI fragments are then the similarity measure for these terms. If Wikipedia returns an suggestion for the term, a new query is made with this new search term. This is typically the case when entering a misspelled term in the search. An overview of the entire system is shown in Fig. 1. 1.2 Specific techniques used Our first test was to search for the whole term in Wikipedia. We call this approach simple search. As a result the precision is high in contrast to the recall which is very low. To

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A replication study: understanding what drives the performance in WikiMatch

We replicate and demonstrate that the performance of the WikiMatch automated ontology alignment system may be driven not by the particular information from Wikipedia directly used by the system, but rather by string similarity and Wikipedia’s manually curated synonym sets, as encoded in the site’s query resolution and page redirection system. In order to gain a detailed understanding of how Wik...

متن کامل

WikiMatch - using Wikipedia for ontology matching

Finding correspondences between different ontologies is a crucial task in the Semantic Web. Ontology matching tools are capable of solving that task in an automated manner, some even dealing with ontologies in different natural languages. Most state of the art matching tools use internal element and structure based techniques, while the use of large-scale external knowledge resources, especiall...

متن کامل

WikiV3 results for OAEI 2017

WikiV3 is the successor of WikiMatch (participated in OAEI 2012 and 2013) which explores Wikipedia as one external knowledgebase for ontology matching. The results show that the matcher is slightly better than matchers based on string equality and can get higher recall values. Moreover due to the construction of the system it is able to compute mappings in a multilingual setup. 1 Presentation o...

متن کامل

Variations on aligning linked open data ontologies

Traditional OA systems are not as suitable for aligning LOD ontology schemas; for example, equivalence relations are limited among LOD concepts so that OA systems for LOD ontology alignment also find subclass and superclass relations. Four recent approaches for LOD ontology alignment are BLOOMS (BL) [1] and BLOOMS+ [2], AgreementMaker (AM) [3], WikiMatch (WM) [4], and Holistic Concept Mapping (...

متن کامل

WeSeE-Match results for OEAI 2012

WeSeE-Match is a simple, element-based ontology matching tool. Its basic technique is invoking a web search engine request for each concept and determining element similarity based on the similarity of the search results obtained. Multi-lingual ontologies are translated using a standard web based translation service. The results show that the approach, despite its simplicity, is competitive wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012